Corpus: pol_news_2008_30K

Other corpora

2.2.5 Most frequent word beginnings

The most frequent word beginnings as character N-grams for N=1...5 with Zipf's diagram


Zipf's diagram for word beginnings


Gnuplot diagram

Top Characters
word rank frequency n-gram
1 8783 p-
2 4798 w-
3 4736 z-
4 4565 s-
5 3534 o-
Top Character Bigrams
word rank frequency n-gram
1 3414 po-
2 3392 pr-
3 2623 za-
4 2291 wy-
5 1362 na-
Top Character Trigrams
word rank frequency n-gram
1 2315 prz-
2 959 nie-
3 751 pod-
4 715 roz-
5 591 pro-
Top Character 4-Grams
word rank frequency n-gram
1 1537 prze-
2 768 przy-
3 197 Prze-
4 173 dzie-
5 157 niep-
Top Character 5-Grams
word rank frequency n-gram
1 242 przes-
2 178 przed-
3 160 przek-
4 134 przew-
5 134 przec-
539 msec needed at 2018-03-19 21:04